Reinforcement learning

Results: 1147



#Item
221Mathematical optimization / Operations research / Dynamic programming / Markov processes / Stochastic control / Markov decision process / Reinforcement learning / Convolution / Optimal control / Q-learning

MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Truncated Approximate Dynamic Programming with Task-Dependent Terminal Value Farahmand, A.-M.; Nikovski, D.N.; Igarashi, Y.; Konaka, H.

Add to Reading List

Source URL: www.merl.com

Language: English - Date: 2016-04-13 11:53:14
222Dynamic programming / Markov processes / Stochastic control / Cognitive science / Probability theory / Markov decision process / Reinforcement learning / Theory of mind / Probability / Psychology / Cognition

Cognitive Science–618 Copyright © 2014 Cognitive Science Society, Inc. All rights reserved. ISSN: printonline DOI: cogsInferring Learners’ Knowledge From Their Ac

Add to Reading List

Source URL: cocosci.berkeley.edu

Language: English - Date: 2015-10-08 17:16:40
223

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: koray.kavukcuoglu.org

Language: English - Date: 2016-06-17 02:02:41
    224

    Journal of Machine Learning Research–1105 Submitted 2/05; Published 6/06 Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems∗

    Add to Reading List

    Source URL: www.jmlr.org

    Language: English - Date: 2006-06-22 16:25:37
      225Artificial neural networks / Cybernetics / Applied mathematics / Machine learning / Recurrent neural network / Backpropagation / Jrgen Schmidhuber / Deep learning / Types of artificial neural networks

      Evolving Large-Scale Neural Networks for Vision-Based Reinforcement Learning Jan Koutník Giuseppe Cuccu

      Add to Reading List

      Source URL: people.idsia.ch

      Language: English - Date: 2013-08-16 10:17:44
      226Dynamic programming / Markov processes / Stochastic control / Belief revision / Reinforcement learning / Markov decision process / Probability distribution

      Targeting Specific Distributions of Trajectories in MDPs∗ David L. Roberts1 , Mark J. Nelson1 , Charles L. Isbell, Jr.1 , Michael Mateas1 , Michael L. Littman2 1 2

      Add to Reading List

      Source URL: www.kmjn.org

      Language: English
      227

      Journal of Machine Learning Research1578 Submitted 11/13; Revised 11/14; Published 8/15 RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research

      Add to Reading List

      Source URL: jmlr.org

      Language: English - Date: 2015-10-01 08:22:29
        228

        Training Factor Graphs with Reinforcement Learning For Efficient MAP Inference Michael Wick, Khashayar Rohanimanesh, Sameer Singh, and Andrew McCallum University of Massachusetts Computer Science Department (IESL) Proba

        Add to Reading List

        Source URL: people.cs.umass.edu

        - Date: 2009-11-06 19:58:51
          229

          Journal of Arti cial Intelligence Research Submitted 9/95; published 5/96 Reinforcement Learning: A Survey Leslie Pack Kaelbling

          Add to Reading List

          Source URL: www.jair.org

          Language: English - Date: 2006-02-07 22:52:16
            230

            Playing Tetris with Deep Reinforcement Learning Matt Stevens Sabeek Pradhan

            Add to Reading List

            Source URL: cs231n.stanford.edu

            Language: English - Date: 2016-03-23 18:05:40
              UPDATE